SWORD - a highly efficient protein database search
نویسندگان
چکیده
MOTIVATION Protein database search is one of the fundamental problems in bioinformatics. For decades, it has been explored and solved using different exact and heuristic approaches. However, exponential growth of data in recent years has brought significant challenges in improving already existing algorithms. BLAST has been the most successful tool for protein database search, but is also becoming a bottleneck in many applications. Due to that, many different approaches have been developed to complement or replace it. In this article, we present SWORD, an efficient protein database search implementation that runs 8-16 times faster than BLAST in the sensitive mode and up to 68 times faster in the fast and less accurate mode. It is designed to be used in nearly all database search environments, but is especially suitable for large databases. Its sensitivity exceeds that of BLAST for majority of input datasets and provides guaranteed optimal alignments. AVAILABILITY AND IMPLEMENTATION Sword is freely available for download from https://github.com/rvaser/sword CONTACT [email protected] and [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
An Effective Path-aware Approach for Keyword Search over Data Graphs
Abstract—Keyword Search is known as a user-friendly alternative for structured languages to retrieve information from graph-structured data. Efficient retrieving of relevant answers to a keyword query and effective ranking of these answers according to their relevance are two main challenges in the keyword search over graph-structured data. In this paper, a novel scoring function is proposed, w...
متن کاملA Recyclable Poly(ionic liquid)s Enzyme Reactor for Highly Efficient Protein Digestion
One of the most significant tasks for proteomic research and industrial applications, is the preparation of recyclable enzyme reactor. Herein, a novel recyclable enzyme reactor has been developed based on monodispersed spherical poly(quaternary ammonium ionic liquid)s particles immobilized trypsin. A new quaternary ammonium ionic liquids functional monomer was first synthesized. The ionic l...
متن کاملHighly Efficient Transfection of Dendritic Cells Derived from Esophageal Squamous Cell Carcinoma Patient: Optimization with Green Fluorescent Protein and Validation with Tumor RNA as a Tool for Immuno-genetherapy
This study was conducted to optimize a highly efficient mRNA transfection into dendritic cells (DC) derived from esophageal squamous cell carcinoma (ESCC) patients. Applying an electroporation technique, in vitro synthesized Green Fluorescent Protein (GFP) mRNA was transfected as an indicator into the DCs derived from a healthy donor. Flow cytometry revealed 84.9% transfection efficiency for DC...
متن کاملGeometric Suffix Tree: A New Index Structure for Protein 3-D Structures
Protein structure analysis is one of the most important research issues in the post-genomic era, and faster and more accurate query data structures for such 3-D structures are highly desired for research on proteins. This paper proposes a new data structure for indexing protein 3-D structures. For strings, there are many efficient indexing structures such as suffix trees, but it has been consid...
متن کاملA Query-oriented XML Fragment Search Approach on A Relational Database System
In this paper, we propose a query-oriented fragment search approach for efficient and effective XML search engines on relational database systems. Conventional approaches for XML fragment search have only considered statistics of XML documents stored in a database system to rank a search result. As a result, almost all XML fragment search engines have attained negative results in the research f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 32 17 شماره
صفحات -
تاریخ انتشار 2016